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The equation which connects the trace of a rotation matrix and that of its square, and the secular 
equation for a rotation matrix, both of which are direct results of the second orthogonality conditions, 
are solved by purely analytic methods based on the group property and the periodicity property of 
rotation matrices. The point is thus made that the well known formulas for the trace of a rotation matrix 
in terms of the angle of rotation and for the rotation matrix itself in terms of the axis and angle of 
rotation, are closely related to the algebraic properties of rotation matrices. 
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1. Introduction 

In paper I [l] 1 of this series two of the results derived from the second orthogonality condi- 
tions were an equation which connects the trace of an orthogonal transformation matrix with that 
of its square, and the secular equation for a transformation matrix. It was stated that for rigid rota- 
tions these equations could be solved directly by analytic methods based only on very general 
properties of rotation matrices. The details of the solutions were deferred to the present paper. 

For completeness we rewrite here the equations to be solved in this paper and the solutions, 
using R to denote a rotation matrix. The equation which connects tr R and tr(R 2 ) is eq (I 15) with 

p7p=i, 

(trR) 2 -tr (R 2 )=2 tr R. (1) 

The secular equation for R is eq (I 22) with p '/p= 1, 

R 3 -(trR) R 2 +(tr R) R-I=0, (2) 

where / is the identity matrix. The well known expression for tr R in terms of the rotation angle a is 

tr R = 1 + 2 cos a. (3) 

When this is inserted in eq (2), the secular equation becomes 

R 3 -(l + 2 cos a) # 2 +(l + 2 cos a)R-I = 0. (4) 

The solution of eq (4) is 

#(«) = /- A^sin a + /V 2 (l-cosa), (5) 
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where N is given in terms of the components of the axis of rotation by eqs (I 26) and (I 27). The end 
results of this paper are derivations of eq (3) from eq (1) and eq (5) from eq (4). 

The properties of rotation matrices which the derivations in this paper are based on are the 
group property and the periodicity property. The group property follows from the fact that the 
product of two rotation matrices is a matrix which has determinant unity and satisfies the first 
orthogonality conditions, and therefore represents a rotation about some axis. The computation 
of the axis and angle of the product rotation was treated as an example in section 4 of paper II 
[2]. For the purposes of this paper a restricted statement of the group property suffices. This is the 
statement that the product of two matrices representing rotations about the same axis is a matrix 
representing a rotation about that axis by an angle which is the sum of the angles in the component 
rotations. If R(a) represents a rotation about a fixed axis by an angle a, the restricted statement 
of the group property is 

R(a 2 )R(a 1 )=R(a 1 + a 2 ). (6) 

The second property of rotation matrices that we use is the periodicity property. We take as 
the statement of this property the equality of R(2tt) (for any axis) with /?(0), which is the identity 
matrix, 

R(2ir)=U(0)=/, (7) 

together with the requirement that for no angle smaller than 27r is R(a) equal to the identity. 
The second part of the statement is necessary in order that a be the true angle of rotation rather 
than some multiple of it. From eqs (6) and (7) an alternative form of the periodicity property is 

R(a + 27r) = R{a)R(2ir)=R(a)' (8) 

The solutions eqs (3) and (5) are well known and are not the objects of interest in this paper. 
The objects of interest are the derivations of these solutions to be presented in sections 2 and 3 
respectively. The aim in giving these new derivations is to call attention to the fact that the trace 
formula eq (3) and the closed form eq (5) are direct consequences of algebraic properties of rotation 
matrices. Thus, although the derivations given here do not give much geometrical insight into the 
nature of the solutions, they may give a deeper insight. 

2. Solution of the Trace Equation 

We represent the dependence 2 of tr R on a by the function g(a) , 

g(a) =trR(a)' (9) 

We now observe from eq (6) that [R(a)] 2 = R(2a). It then follows from eq (9) that 

tr[R(a)] 2 = trR(2a)=g(2a)' (10) 

We may therefore rewrite eq (1) as 

[g(a)]*-g(2a) = 2g(a) 9 (11) 

which is a functional equation for the unknown function g{a) . A more transparent functional equa- 
tion is obtained by defining the function 

h(a)=i[g(«)-l], (12) 

and rewriting eq (11) in terms of h. The functional equation for h can be written 

[h(a)]* = i[h(2a)+l]. (13) 

Equations (11) and (13) are both independent of the axis of rotation. 



2 The fact that g(a) is independent of the axis of rotation is not immediately apparent from eq (1). It will become apparent as the derivation proceeds. 
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From eq (13) one can begin to see the connection between the result h(a) = cos a, and eq (1). 
The structure of eq (13) is identical to that of the formula for the cosine of a double angle. How- 
ever, cos a is not the only solution to eq (13). We will calculate all of its solutions and then see how 
the geometrically meaningful solution is selected by the periodicity condition. 

The coordinate transformations which underlie rotation matrices are analytic functions of the 
parameters which specify a rotation. 3 It follows from this that h (a) can be expanded in a convergent 
real Taylor series about a = 0. Now we observe from eq (13) that if h{a) is a solution, so also is 
h(— a). That is, h(a) is an even function of a, and when expanded in a power series about a=0 
contains only even powers of a. We therefore write the assumed series in the form 

h(a)= f C 2m a 2m , 

rfco (14) 

where the Cj are constant expansion coefficients. The square of this series is 

[M«)] 2 = J JCwCafl*^. (15) 

m = n = 

We notice that in h 2 , a occurs raised to the power 2 (ra + ra). This suggests that we change the 
variables of summation in eq (15) from m and ra, to a pair of variables one of which is ra + n. Ac- 
cordingly we define new variables j and k by j = ra + n, k = m — n. To find the ranges of j and k 
we note that j is nonnegative and increases in steps of one. Hence its range is all of the nonnegative 
integers. We next observe that for a given value of 7, there are 7 -hi allowed pairs of values of m 
and n in which the sum of the values of m and n in each pair equals the given j value. These pairs 
(m, n) are (0, 7), ( 1 , 7 — 1 ) , (2,y — 2), . . . , {j— 1, 1) , (7, 0)- The values of k corresponding to 
these pairs are k = —j,—j-\-2, — y + 4, . . .,7 — 2,7. We see that k ranges from —7 to +7 in steps 
of two. It is convenient to parametirize k as k = —j+2r where r is an integer taking on the values 
0, 1, 2, . . .,j. Since the subscripts on the C's in eq (15) are 2m and 2ra, we have to express these 
in terms of 7 and r. The results are 2m = 2r, 2n = 2(j — r). The series for h 2 may then be written 



[M«)P=2 £c 2r cv r) 

j=o L r =o 



ol 2 K (16) 



Note that the order of summation in eq (16) cannot be interchanged since the range of r depends 
upon 7. 

If we insert the series eq (14) evaluated for argument 2a, and the series eq (16) for argument 
a into the functional equation (13), we get 

J [ 2 C 2 rC 20 -r)l C& = 1 [J 2VC 2j O&+ ll . (17) 

By equating the coefficients of like powers of a on the two sides of this equation we arrive at a set 
of equations for the determination of the C 2 j. We break this process into the cases j= 0, j= 1, 
and7>2. 

(a) j= 0. This yields an equation for Co, 

CJ=i(Co+l), (18) 

whose solutions are Co = —i, and Co= 1. The former solution can be shown to generate a "series" 
all of whose higher order coefficients are zero. This corresponds to the constant solution h(a) =— i 
of eq (13), or by eq (12) to the trivial solution g(a) = of eq (11). This solution can be ruled out be- 
cause it has arbitrarily small periods in a, whereas the second part of the periodicity condition 
requires that there be no period smaller than 27T. We therefore concentrate on the series generated 
by the value Co=l. 

(b) 7=1. Equating coefficients of a 2 on both sides of eq (17) we get 

C () C 2 = C U (19) 



3 This is equivalent to saying that the rotation group is a Lie group. 
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and setting Co= 1 we get the identity C 2 = z C 2 . This means that C 2 is not determined by eq (13) and 
is therefore a free parameter of the solution. All of the suceeding coefficients will depend on the 
value of C 2 . 

(c) j ^ 2. Equating the coefficients of a 2j on both sides of eq (17) we have 



j 

1 

r = 



2C»Cio-r)-2*-'C w . (20) 



Now since j^ 2 we can write the first and last terms (i.e., those for r=0 and r=j) on the left side 
of eq (20) separately from the rest of the sum, giving 

2CoC 2j + § C 2r C 20 _ r) = 2 2 ^C 2j . (21) 

r=l 

Upon setting Co = 1 this equation can be solved for C 2 j to give 

1 J'- 1 

C V= 2 [ 2 20-D_ i ] E^**^-* (22) 

This is a recursion relation for C 2 j. 

Computation of the first few coefficients from eq (22) gives 

C 4 — -T- 



C 6 = 



6 4! 

C 2 C4 ^q_ (2C 2 ) 3 = (2C 2 ) 3 
15 _ 90~ 720 6! ' 



^ 2C 2 C 8 + t :f = (2C 2 )^ = (2C 2 ) 4 . 
8 2(63) 40,320 8! 

This suggests the assumption that 

„ (2 c 2 y 



^J~ ( 2 j)\ ' (23) 

for ally 2^0, since it is automatically true for j=0 and 1, and has just been demonstrated to be 
true fory=2, 3, and 4. To prove it inductively for ally', we assume that its truth has been dem- 
onstrated for all j up to a certain value, say j = iV, where A^ is an integer equal to or greater than 
two. Then we use eq (22) to compute C 2 j for j=N+ 1. Since all of the subscripts on the C's on the 
right side of eq (22) are less than or equal to N we can use the assumption eq (23) for those C's 
with the result 

(2C 2 ) V+1 N 1 

C2( ^ +1) = 2(2^-l) r ? i (2r)![2(A-r+l)]!* (2 4) 

The sum on the right side of this equation can be evaluated by using the property of the binomial 
coefficients that (ref. [3] ) 

r=o\2r / (25) 

From this property it follows by algebraic manipulation that 

n i _ 2(2^- 1) 



(2r)![2(/V-r+l)]! [2(N+l)]l (26) 
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Insertion of this value for the sum on the right side of eq (24) gives 

(2C 2 )* +1 



'2(N+\)~ 



[2(/V+l)]! 



which is just eq (23) for j=N+l. This then furnishes the inductive step that allows us to assert 
the truth of eq. (23) for all; from its demonstrated truth for/ ^ TV. 

If we now set 2C 2 = K 2 (where K may be real or pure imaginary, depending on whether C 2 is 
positive or negative) and insert eq (23) for the coefficients into the series eq (14), we can recognize 
the resulting series as the Taylor expansion about a = of cosh Ka, 

h(a) = cosh Ka. (27) 

Equation (27) is therefore the most general, regular solution of the "double angle cosine formula" 
eq (13). The lack of uniqueness of the solution is expressed by the arbitrariness of the parameter /£. 
From the definition eq (12) we have 

g(a) = 1 + 2 cosh Ka. (28) 

Since, from the first part of the periodicity property each element of R(a) has period 27T, so does 
g(a). This restricts K to be the imaginary number wu where m is any positive or negative integer. 
The second part of the periodicity property, which requires that g(a) have no period smaller than 
277, selects the particular values m = ±l. The result for either value of m is 

g(a) =tr R(a) = 1 + 2 cos a, 

which is eq (3). 

3. Solution of the Secular Equation 

The group property (6) allows us to replace [R(a)] 3 and [R(a)] 2 'm eq (4) by R{3a) and R (2a). 
respectively. The resulting equation contains no rotation matrix raised to a power higher than 
the first. 

R (3a) -(1 + 2 cos a)R (2a) + (1 + 2 cos a)R (a) -7=0. (29) 

We can therefore try to solve this equation by a Fourier series between and 27T in a, rather 
than by a power series as was done with the trace equation. The advantage of such a Fourier 
series solution is that it automatically incorporates the periodicity property. 

It turns out to be more practical for this problem to use a Fourier series for R (a) in the complex 
exponentials e ima rather than in sin ma and cos ma. We write this series as 

R(a)= JT (m\R)e ima . (30) 

In eq (30) we are using Dirac notation [4] for the Fourier coefficient matrices in order to avoid com- 
plicated expressions in subscripts, in equations for the coefficients to be derived soon. The Fourier 
coefficients are given by the usual formula 



1 f 27r 
(m\R >=— R{oi)e- ima da. 



(31) 



The entire dependence of R(a) on the axis of rotation is contained in these Fourier coefficients. 

We now substitute the series eq (30) evaluated for the appropriate arguments into each of 
the terms of eq (29). We also express the coefficients 1 + 2 cos a as l + e**+e -/a . The resulting 
equation can be written 

V (m\R) \e :ilma — g2ima_|_ e ima_ gi(2m + l)a _|_ e i(m + l)a _ gi(2/w-l)a _j_ e i(m-l)al — J /OO) 

m= -oo 
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Equations for the Fourier coefficients are extracted from eq (32) by the use of the orthonormality 
property of the complex exponentials 



( 



2tt 

e *r-k)a da = 2lT b rk , (33) 



where d r k is the Kronecker delta, equal to one if r=k and equal to zero if r # k. Multiplication 
of eq (32) by e~ ika , and integration of the resulting equation with respect to a from to 2tt yields, 
with the help of eq (33) 

X 

^j(m\R) [8*,3m — 0\,2w+6\, m — 8fr,2ro + l + 8A-,m+l — Ofr,2m-l + Ofc, m _i] = 8fro/. (34) 

Tfl= —oc 

In eq (34) A is any integer, positive, negative, or zero. 

We divide the consideration of eq (34) into three separate cases based on the value of k. 

(a) k = 0. 

For this case Sao— 1. On the left side of eq (34), the first, second, and third terms contribute for 
m = 0. The fifth term contributes for m = — 1 while the seventh term contributes for m=l. The 
fourth and sixth terms do not contribute for any value of m. The equation then reduces to 

(0|«) + <l|/?)+<-l|«)=7. (35) 

(b) k is a positive or negative even integer, but not zero. 

We represent k by 2/ where j is any positive or negative integer. For this caseSA-o— 0. On the 
left side of eq (34) the fourth and sixth terms do not contribute for any value of m. The relation 
between the coefficients for this case is 

(%j\R)-<j\R) + (2j\R) + (2j-l\R) + (2j+l\R) = 0. (36) 

(c) k is a positive or negative odd integer. 

We represent k by 2j-\-l where j is a positive or negative integer, or zero. We again have 8ao— 
in eq (34). On the left side of that equation the sole noncontributing term is the second one. The 
relation between the coefficients which follows from eq (34) for this case is 

<4(2j+ 1) |/?> + <2j-M|/?> - 0|/?> -I- <2j|/?> — <y-h l|/?> + <2(y-h 1) |/2> = 0. (37) 

Evaluation of the coefficients proceeds by setting j equal to the first few of its allowed values 
in eqs (36) and (37), and trying to find a general trend which can then be tested inductively. In 
this process we set equal to zero any coefficient (m\R) in which m is not an integer. Thus, setting 
7 = in eq (37) and equating (i\R) to zero, we find that (2\R) = 0. Setting^ 1 in eqs (36) and (37), 
leads to the results (3|/?) = 0, <4|/?> = 0. The value 7 = -1 leads to <-2|/?> = 0, <— 3[/?>=0. The 
value of 7 = 2 leads to <5|R) = 0, <6|«) = 0, while 7 = -2 leads to <— 4|/?> = 0, <-5|/J)=0. The 
results (7\R) = (8\R) = (-6\R) = <-7|/?> = follow from setting ; = 3 and then -3. 

These results suggest that (m\R) = for all \m\ 2* 2. To test this we follow the trend suggested 
by these results and assume it has been shown from eqs (36) and (37) for all integer values of \j\ 
up to and including some value J greater than one, that 

<2|/0 = <3|1?>= • • • =<2L/+l|/?) = <2(7 + l)|/?> = 0, (38a) 

<-2|/?> = <-3|/?>= • • • =(-2J\R)=(-(2J+l)\R) = 0. ( 38b) 

We then set L/1=y+ 1 in eqs (36) and (37). This means that we have to consider the two values 
j = ± C/+ 1). For the plus sign eqs (36) and (37) yield, respectively 

(f(y+l)|/?)-(/+l|^) + <2(J+l)|/?)-f(2J+l| J R) + (2y + 3|/?) = 0, (39a) 

<K2y + 3)|«) + <2y + 3|/?)-a+l|/J>+(2(y-r-l)|/?)-a + 2| J R)+(2(y + 2)|/i) = 0. (39b) 
For 7*=— (7 + 1) eqs (36) and (37) give, respectively 

<— f U + 1) |/2> — <— (J+ 1) |/2> -+- <— 2(J-h 1) |/?> H- <— (2J + 3)|/?> -f- <— (2JH- 1) |/f> = 0, (40a) 
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(-H2J+l)\R)+(-(2J+i)\R)-(-U+i)\R) + <-2U+i)\R)-{-J\R)+(-2J\R) = 0- (40b) 

In eqs (39) and (40), the coefficients (J+ l\R), (2(J + 1)\R), (2J+ 1\R), (J + 2\R), <- (J + 1)\R), 
<— (2J+l)\R), (—J\R), and (-2J\R), vanish by the assumptions eqs (38) and the fact that 
7>1. The coefficients (HJ + 1)[R), (i(2J + 3)\R), <-f (J+1\R), and <-i(2/+l)|K> all vanish 
because in each case either the index is not an integer, or if an integer, the index falls in the range 
of eqs (38). After striking out the vanishing coefficients, the results of eqs (39) and (40) are 

<2y + 3|/?> = <2C/+l) + l|K> = 0, 

<2C/ + 2)|R> = <2(./ + 1 + 1)|«> = 0, 

<-2(./+l)|/?> = 0, 

<-(2J + 3)|/?> = <-[2C/+l) + l]|/?> = 0. 

These results extend the range of assumptions (38) from \j\=J to \j\=J + l. We have therefore 
shown by induction that the only nonvanishing coefficients in the Fourier series eq (30) are (—117?), 
<0|ft),and (l\R). 

The Fourier expansion of R (a) may now be written 

R{a) = (0\R) +(l\R)e ia +(-l\R)e- ia . (41) 

The coefficients in this expansion obey the restriction (35). We now define the matrix coefficients 
L, M, and N from the equations 

L=(0\R), (42a) 

M=(l\R) + (-\\R), (42b) 

iV — i«l|«>-<-l|R». (42c) 

The restriction (35) expressed in terms of L, M, and N, is 

L + M = I. (43) 

The expansion (41) now takes the form 

R(ol)=I — M — N sin a + M cos a. (44) 

In arriving at the form (44) we have assumed that R(a) obeys the group property. We now have to 
insure that it does by imposing the group property on eq (44). For this purpose it is sufficient to 
require the condition [R(a)] 2 = R(2a). This then implies eq (6) for arbitrary a\ and a 2 . Equating 
R(2a) to the square of the right side of eq (44), and expressing the quantities sin 2 a, cos 2 a, and 
sin a cos a in terms of cos 2a or sin 2a, we have 

I-M + M cos 2a-N sin 2a =2M(I-M) cos a- (2N-MN- NM) sin a-i (MN-\-NM)s'm 2a 

+ i(^ 2 + ^V 2 ) + (/-^) 2 + H^ 2 -A /2 ) cos 2a. (45) 

By equating the coefficients of similar Fourier components on the two sides of this equation, we 
arrive at the relations 

M = -N\ (46a) 

/V 3 = -/V. (46b) 

These relations show that all powers of TV higher than second are equivalent to either /V or iV 2 . 
Equation (44) may now be written 

R(a)=I + N 2 -N sin a-/V 2 cos a. (47) 
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It remains only to determine the matrix N. This is done by imposing both the first and second 
orthogonality conditions on the form (47). The most convenient way to impose the first orthogonality 
conditions is to consider an infinitesimal rotation. This leads to the restriction 

N = -N, 

which states that TV is antisymmetric. Instead of imposing the complete second orthogonality condi- 
tions it is sufficient to impose a consequence of them, namely, the existence of an axis of rotation. 
Therefore if n is a column matrix representing the axis of rotation, imposing the condition 

R(a)n = n, (48) 

places a restriction on TV which guarantees that eq (47) obeys the second orthogonality conditions. 
Equation (48) becomes 

N 2 n — N 2 n cos a— Nn sin a = 0, 

and the restriction which this places on N is 

Nn = 0. (49) 

If the components of n are formally regarded as known, the eqs (49) are three linear homogeneous 
equations for the determination of the three independent elements of N. These elements are linear 
functions of the components of n. The only antisymmetric matrix which can be formed linearly 
from n is €«ja rij. Hence 

N ik = a€ ijk n h (50) 

where a is a constant. It can be determined by imposing the condition (46b) on eq (50). The result 
is that a 2 =l. By choosing a = +l, we formally complete the identification of eq (47) with the 
solution (5). 

The exponential representation (I 28) of R (a) is established by noting from eq (5) (or (47)) that 

£--«• ™ 

Recognizing that R(0) = I, eq (51) can be formally integrated to give 

R(a) = e- Na . (52) 

To establish the equivalence between eqs (47) and (52), one expands eq (52) as a power series in 
TV a. By noting eq (46b), all powers of N which arise are expressible as either N or TV 2 . The resulting 
series is 

n , . y m 7 a 3 a 5 \ /a 2 a 4 a 6 \ 

KW-'-^-si+sr • • -r^ (21-4J+6!- • • •)• < 53 ) 

The coefficient of TV can be recognized as the beginning of the series for sin a while the coefficient 
of N 2 is seen to be the start of the series for 1 — cos a. 

Note added in proof. Since the publication of papers I, II, and III, a reference [Gibbs, J. W., 
and Wilson, E. B., Vector Analysis, sec. 53. (Charles Scribner and Sons, New York, 1909; Dover 
Publications, Inc., New York, I960)], has come to the author's attention in which the second 
orthogonality conditions are derived in a way which is similar to the method used in I, although 
not exhibited in matrix notation. Improper rotations are not taken into account. The relations 
which we have called the second orthogonality conditions were apparently known to mathematicians 
of the nineteenth century although no special name was given to them. The full exploration of the 
content of those equations, which has been the subject of this series of papers, does not appear 
to have been carried out until now. 
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